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Summary 



For estimating a lower bounded parametric function in the framework of Marchand and Strawderman 

00; 

(2006), we provide "through" a unified approach a class of Bayesian confidence intervals with credibility 

^— > ■ 

1 — a and frequentist coverage probability bounded below by In cases where the underlying pivotal 
distribution is symmetric, the findings represent extensions with respect to the specification of the credible 
>" set achieved through the choice of a spending function, and including Marchand and Strawderman's HPD 

00 : 

procedure result. More significantly for non-symmetric cases, the lower bound finding, although not 

O ' 

[ applicable to the HPD procedure, is novel. Several examples are presented demonstrating wide applicability. 
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1. Introduction 



Bayesian credible sets are not designed (e.g., Robert, 2011) and are far from guaranteed (Fraser, 
2011) to have satisfactory, exact or precise frequentist coverage but it is nevertheless of interest 
to investigate (Wasserman, 2011) to what extent there is convergence or divergence in various 
situations, including the choice of the prior and of the credible set. A historically resonating 
example where there is exact convergence arises for estimating the mean of a N(/i, a 2 ) distribution, 
and where the use of the non- informative prior leads to a (1 — a) x 100% HPD credible set (i.e. the z 
or t confidence interval) with exact frequentist coverage. This, however, is very much the exception. 
Even, in the simple presence of a lower bound on the mean parameter /z (e.g., Mandelkern, 2002), 
with the prior taken to be the truncation of the non-informative prior onto the restricted parameter 
space, the frequentist coverage of the (1 — a) x 100% HPD credible set fluctuates from its credibility 
(or nominal coverage) 1 — a. However, the HPD procedure does not fare poorly as a frequentist 
procedure for large 1 — a as witnessed by the lower bound on its frequentist coverage due to 
Roe and Woodroofe (2000, known a 2 ) and Zhang and Woodroofe (2003, unknown a 2 ), as well as 
the better lower bound 1 — ^ (for a < 1/3, known a 2 ) obtained by Marchand et al. (2008). 

Marchand and Strawderman (2006) showed that the lower bound is applicable in a vast number 
of situations. Their findings relate to the following setup. 

Assumption 1. We have a model density f(x;9); x G X, 9 e 6; for an observable X, with 
both X and 6 being vectors, and we seek to estimate a parametric function t{6) (W — > $1) with 
the constraint t{6) > 0. We assume there exists a pivot T(X,9) = ai ^|~^ e - > ; a2(-) > 0; such 
that —T(X,9) has cdf G and Lebesgue density g. We consider prior measures n and ir , where 
7r (6>) = 7t(0).T[o )O o)(t(0)) ; and n is the Haar right invariant measure. We further assume that the 
decision problem is invariant under a group Q of transformations and that the pivot satisfies the 
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invariance requirement T(x, 9) = T(gx, g6), for allx<EX,9<EQ,g<EG,g<ECj, with X , O, G, and 
G being isomorphic. Referring to Marchand and Strawderman (2006, Lemma 2 and Corollary 1) for 
further details, we recall a key feature of the above structure, which is that the posterior distribution 
under n ofT(x,9) coincides, for all x, with the frequentist distribution ofT(X,9) (which is free of 
9) and thus has cdfG. 

For symmetric and unimodal g, Marchand and Strawderman (2006) show that the (1 — a) x 100% 
HPD credible set has frequentist coverage C{9) greater than for all 9 such that t{9) > 0. 
For non-symmetric densities g, Marchand and Strawderman (2006) provide proof in some cases, 
and evidence in others, of satisfactory performance for the HPD credible set. More precisely, 
the proof of the validity of the lower bound is achieved with an additional assumption on 
the type of asymmetry (skewness) of the distribution of the pivot T(X,9), while the evidence is 
based on numerical evaluations of a theoretical and unexplicit lower bound for frequentist coverage 
(e.g., Example 2 of Marchand and Strawderman, 2006 concerning a lower bounded Gamma scale 
parameter). Nevertheless, a clear result or lower bound for frequentist coverage in such cases is 
lacking, and it is our motivation here to try to fill this gap. 

Indeed, the main finding here is as follows. For prior measure tc , we obtain a class of (1 — a) x 100% 
credible sets for t(9) for which 

1 - a 

C{9) > , for all 9 such that r(9) > 0. (1) 

1 + a 

This is achieved irrespectively of G and is thus totally general in the context described above. Our 
class of credible sets includes an "equal-tailed modification" of the HPD procedure which we describe 
below, and which coincides with the HPD procedure whenever g is symmetric and unimodal. Hence, 
our findings as applied to the equal tailed procedure extends those of the symmetric case. 

It is by considering a class of Bayesian credible sets, rather than only the HPD procedure, that 



we are able to establish clear results on frequentist coverage. It is indeed with the analysis of the 
spending function related to credible set that we can pinpoint conditions for which the lower bound 
([1]) holds. The main findings are elaborated upon in Section 2 and various applications are presented 
in Section 3. To facilitate the presentation of the results, here is a list of definitions and notations 
used. 

Cheklist 

1 — a: credibility or posterior coverage or nominal frequentist coverage (a G (0, 1)) 
T(X,e) = a -^M: pivot 

7r: unrestricted prior density chosen as the right Haar invariant measure 

7r : prior density given by the truncation of 7r onto the restricted parameter space 

G: cumulative distribution function (cdf) of — T(X, 9)\x and of —T(X,6)\6 under it (which 
coincide for all x, 9) 

g = G'\ probability density function (pdf) of —T(X, 9) 
G^ 1 : inverse cdf 
ct(-): spending function 

I n0: a(.)(X) = [l(X),u(X)]\ Bayesian credible set of credibility 1 — a associated with the prior 
7r and the spending function a(-) 

C{9): the frequentist coverage at 9 of the confidence interval / 7roQ (.)(X) given by C{9) = 
Pe(Ua(.)(X) 3 t(9)) 

yo = -G-\^- a ) 

*(*) = Si 

A (x) = (l-a)(l-G{-t(x))) 
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2. Main results 



We begin this section by describing two different, yet equivalent, and instructive approaches to 
constructing a credible set for r{9). 

(A) (Spending function approach) 

A (1 — a) x 100% credible interval for t{9) associated with prior tt can be generated by 
a spending function ct(-) : W — > [0,a], such that I^ (X) = [l(X),u(X)] □ with P no (r(9) > 
u(x)\x) = a(x) (and consequently P no (r(9) < l(x)\x) = a — a(x) ). More precisely, we have 
the following under Assmption [TJ 

Lemma 1. For a given spending function a(-) , wehavel a t.\(x) = ai(x)+a2{x)G~ 1 {G(— t(x))+ 
(a — a(x))(l — G(— t(x))} and u a {-){x) = a\(x) + a 2 (x)G~ 1 {l — a(x)(l — G(— t(x))}, with 

Proof. Since the posterior cdf of —T(X, 9) under tt is given by G, we have by definition of 
tto, for y > 0: 

P (t(9) >v\x) 1-G(^¥) 

P (t(6) > v\x) - 1 1 ' Vl ' - v ' (o) 

no11 ' ~ Vl ' ~ P«(t(9) > 0| x) ~ 1 - G(-t(x)) ■ 1 > 

Hence, we obtain for j3 G (0,1), y > 0, P 7T0 (r(9) > y\ x) — {3 <^ y = ai(x) + a2(x)G~ 1 {l — 
(3 + (3G(—t(x))}, and the result follows with the choices (3 = a(x) and j3 — 1 — (a — a(x)) for 
u(x) and l(x) respectively. □ 

Example 1. The HPD procedures studied by Marchand and Strawderman (2006) for symmet- 
ric and unimodal g are given by the bounds l{x) = max{0, ai(x)+a2{x)G~ 1 ( l ~ (yl ~°^ <yt<yX ^ )} and 
u{x) = a\{x) + a 2 (x) min{G _1 (l — aG(t(x))), G~ x ( 1Jr V--°¥^^ x )) ) j _ With these given bounds, 



2 we suppress the dependence on a(-) unless required 
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one may verify directly from (TJ|) that the corresponding spending function is equal to 

■ r a G(—t(x) 

mm{a '2 + 2(l-GH(o;))) } ' (3) 
with a(x) = a if and only if t(x) < — G~ 1 (j^) = G~ 1 (j^) since g is symmetric about 0. 
Conversely, applying LemmaU\with the spending function choice a(-) in (TJ|) leads to the HPD 
procedure above (using the equality of G(-) and 1 — G(—) for symmetric about g). 

(B) (Approach based on quantiles of the pivot) 

Alternatively, a second approach for cases where l(x) > begins with choices 71 and 72, which 
will be made for each x, such that (7(72) — G(— 71) = A, for a given A e (0, 1). Since, for any 
x, we require 1 — a = P 7ro (/(x) < r{6) < u(x)\x), we must have by (j2J): 

G (^-LM) _ G ( l{x) -, a l {x) ) = (1 " «)(1 " 
a 2 (x) a 2 (x) 

and this can be achieved with choices —71 and 72 above for A = A (x) = (l—a)(l — G(—t(x))) 
yielding = 7 2 (A (x)) and l(x) ~fy x) = -7i(A (x)), in other words 

l(x) = a 1 (x) - a 2 (x) 7i(A (x)), and u(x) = ai(x) + a 2 (x) 7 2 (A (x)) , (4) 

whenever l(x) > 0. In view of the lower bound restriction on t(9) (i.e., t{6) > 0), and the 
corresponding requirement that l(X) > 0, observe that not all choices of —71 (and hence of 
72) are feasible in (j3J) and that we must have 

- 7l (A (x))>-^M. 

a 2 {x) 

Example 2. With the above construction in Q) ; an equal-tailed choice of —71 and 72, that is 
—71(A) = G^ 1 (^-^-) and 72(A) = G _1 (ii4) ; leads to the credible set bounds 

l(x) = ai(x) + a 2 (x) G _1 ( ), and u(x) = a±(x) + a 2 (x) G~ 1 ( ), (5) 



when l(x) > 0. These above bounds coincide with those of the HPD procedure (when l(x) > 0) in 
the symmetric case of Example [21 as well as the spending function given in |3j) as can be verified 
directly from (Please note that the terminology "equal tails" does not mean a(x) = a/2 but 
rather to the choice of the quantiles —71 and 72 under G.) 

We define yo = — G~ 1 (j^) and t{x) = ^M- Observing that l{x) < in (jSJ) if and only if t{x) < yo, 
we are lead to investigate the frequentist performance of Bayesian credible intervals which have 
l(x) = 0, or equivalently a(x) = a whenever t(x) < y , and which include the equal-tailed procedure 
of Example [2j We now have the following. 

Theorem 1. Under the conditions of Theorem 1 of Marchand and Strawderman (2006), that is 
Assumption^ consider Bayesian credible intervals Itt ,oi{-) associated with prior 7r and a spending 
function a(-) such that a(x) = a for all x with t(x) < y^. For the frequentist coverage C(0) = 
Pe{.I-K ,a{-){.X) 3 t(9)), we then have 

(a) C{0) = ^ for all 9 such that t{6) = 0; 

(b) Moreover, we have C{9) > for all 9 such that t{9) > as long as a(x) satisfies, for all x, 

(1 - a) «,)) + £ £ £ _g_ 1 



l-G(-t(x)) - w - v l + al - G(-t(x)) 
Proof. 

(a) For 9 such that t{9) = 0, we have 

P e {h M {X) 9 0) = P e (a(X) = 0) = < y ) = 1 - G(-y ) = 

since — has cdf G whenever r{6) = 0. 
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(b) As in Marchand and Strawderman (2006), first observe that the confidence interval h{X) = 
[h(X), Ul (X)} = max{0,ai(X) + a 2 (X)G- 1 ( J f^)} } a l (X) + a 2 (X)G- 1 ( J ^)} has the same 
frequentist coverage as I*(X) = [ai(X) + a 2 (X)G~ 1 (j^)}, ai(X) + a 2 (X)G~ 1 (j^)}} equal 
to Pe(G-\^) < iM^p < G-\^-J) = GCG-Hife)) " G(G-\^)) = £f . Now, we 
show that the given conditions on a(-) imply that I m a (.) ^ l\ which will lead to the result 
directly. Indeed, we have by the upper bound in and Lemma [TJ u a ^(x) > ai(x) + 
a 2 (x)G~ 1 (l — jt^) = U\(x). Similarly, from the lower bound ([6]) and Lemma U we obtain 
l(x) < ai (x) + a 2 (x)G- 1 {G(-t(x) + ail - G(-t(x))) - ^ - (1 - a)G{-t{x))} = a x {x) + 
a 2 {x)G- 1 {^) = h{x). □ 

Corollary 1. Under Assumption^ the equal-tails credible interval I wo>a (.) defined by the spending 
function a(-) in |3|) has minimum frequentist coverage C(8) greater than for all 9 such that 
t(0) > 0. 

Proof. It suffices to show directly that (jSJ) is satisfied for the selection a(x) = <y e qi(x) given in (j3]) 
for x such that t(x) > yo. Indeed, we have for such x's: 

«eqt(*)(l " G(-t(x))) = | + G(-t(x)) < f + ^ G(-yo) = ^ , 

and 

a eqt (x)(l-G(-t(x)))-(l-a)G(-t(x))-^- = - ^ G(-t(x)) 

ail — a) 1 — a 

Remark 1. In cases where the underlying pivotal distribution is non-symmetric, CorollaryU\ is a 
new result, generalizing Theorem 1 of Marchand and Strawderman ( 2006 ), by virtue of Example 
U\ and is widely applicable given the lack of assumptions on g. Also, the bounds of the equal tails 
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procedure are easier to evaluate than that of the HPD credible interval. And the findings of Theorem 
U\ go beyond a single procedure, even in the symmetric case, by providing a class of credible sets, as 
specified by a spending function, with frequentist coverage bounded below by . 

3. Examples 

At the risk of some redundancy with the examples provided by Marchand and Strawderman (2006), 
it is still beneficial here to present various applications with accompanying commentary. Assumption 
1 is satisfied in all of the examples below with the underlying family of transformations (distribu- 
tions) being either the location family, the scale family, or the location-scale family. 

(A) (a single location parameter) X ~ f (x - 9); t{9) = 9 > 0; T(X,6) = X - 6; ir (9) = 
l[o,oo)(6 1 )- In such cases, all Bayes credible sets I^oM-) (with credibility 1— at), with the spending 
function a(-) satisfying the conditions of Theorem [1] and the bounds in ([6]), have necessarily 
minimum frequentist coverage bounded below by Through the transformations X — > 

X — a and X — > —X + a, one can reduce all lower bounded restrictions 9 > a and upper 
bounded restrictions 9 < a to the case 9 > considered here and we will not make further 
explicit mention of such transformations below. 

Remark 2. Results such as those in (A) are applicable as well for several observations by condition- 
ing on a maximal invariant. Indeed, suppose that X = {X\, . . . , X n ) ~ fo(x\ — 9, . . . , x n — 9), where 
/o is known and the Xi 's are not necessarily independently distributed. Set V = (X2 — X±,..., X n — 
Xi) as a maximal invariant. One can then proceed, for a given value v of V, with an interval 
estimate J 7ro . Q ,(. )1 ,)(Xi, v ) as given in LemmaUl with G = G v representing the cdf of the pivot X\ — 9 
conditional on V = v, and a(x,v) satisfying the conditions of TheoremUl and (E|). In such a case, 
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Theorem^ applies to the conditional frequentist coverage C(8,v) = Pg(I n0jOl r. jV -)(X,v) 3 t(9)\V = v) 
yielding the inequality C(9,v) > for all 9 > 0. Since this is true for all v, the unconditional 
frequentist coverage C{9) of the Bayes credible set In 0t a(;-)(X, V) will also exceed for all 9 > 
(see Marchand and Strawderman, 2006, for more details related to a multivariate Student model) 
In the same vein, all the scenarios below (B to G), although presented for simplicity in the single 
observation case, are also applicable in presence of a sample by conditioning on a maximal invariant. 

(B) (a lower bounded scale parameter) X ~ |/i(f) 1(q,oo)0e) with 9 > a; r(9) = log(#) — 
log(a) > 0; T(X, 9) = log(X) — log(a) —r(9); n (9) = |l[ 00 )(r(6 1 )). Here, an interval estimate 
of t(9) provides an interval estimate of 9. Important models include Gamma, Weibull, Fisher, 
among others. A familiar setup where the results can be applied arises in random effects 
analysis of variance models with a Fisher distributed pivot (see Zhang and Woodroofe, 2002, 
for details). 

Remark 3. Further applications consist of power parameter families where we have a scale family 
for an observable Y and the model of interest are the distributions for X = e Y . As as simple illus- 
tration, consider the Pareto model for X with densities ^rl(i,oo)(^) and the parametric constraint 
7 G (1, 70). In such cases, we have that 7 log(X) ~ Exp{9) with 9 = ^ > 1 and the results in (B) 
apply. 

(C) (location-scale families ) (X h X 2 ) ~ / (^, f ) l(o,oo)M; r(0) = 9 1 > 0; T(X,9) = 

, txq{9) = ^l(o,oo)(6 l 2)l[o,oo)(6 l i)- This setup encompasses the basic normal case: Xi, . . . X n 
ind. N(/x, a 2 ) with a 2 unknown and fi > 0, and more generally linear models Y = Z(3 + e, e ~ 
N(0, a 2 I n ) where the objective is to estimate a lower-bounded linear combination r(9) = I' (3, 
by setting X x = j3{Z'Zy 1 Z'Y, X 2 = \\Y - Zf3\\ 2 , 9 X =P,9 2 = a. Here, the pivot T(X, 9) has 
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a Student distribution. Alternatively, if the objective is to estimate a lower bounded scale 62, 
one can proceed as in (B). 

(D) (linear combination of several location parameters) X = (Xi,...,X p ) ~ /o(^i — 
1 ,...,x p -0 p ) ] t{6) = YJUia-iOi-MO) = l [Ol oo)(r(0)), T(X,0) = (E?=i ^X,) - r(0). This 
setup includes, for instance, estimating a difference 9\ — 9 2 with an order constraint 9± > 9 2 . 

(E) (multivariate location-scale families with homogeneous scale ) 

In (D), we can incorporate a common scale and apply the results of this paper for estimating a 
lower bounded linear combination with X = (X ± , . . . , X p , X p+1 ) ~ /o( ^~ , ■ ■ ■ , ^g~^ f 2 ^")' 
r{6) = EUaA, T(X,0) = ^g^ , and n (9) = ^W^+i)Wr(£)). 

(F) (several scale parameters ) 

(X u . . . , X p ) ~ (n? =1 i) . . . , g); r(0) = E?=i ^ log(^). This can consist, for instance, 

of estimating a lower bounded ratio |^ > a of two scale parameters. 

(G) (quantiles in location-scale families) X, L ~ N(fi,a) iid; t(9) = /i + 770- > 0, 7t (/x,(t) = 
J 1 (0,00) (o-)l (0,00) (A* + W)- = Here, T(X,9) is distributed as non-central 
Student. The applications are not restricted to normality and are applicable in general for 
location-scale families as in (C). 



Concluding remarks 

For a large variety of situations with a lower bounded parametric constraint, we have obtained 
a class of Bayesian (1 — a) x 100% credible sets which provide minimal frequentist probability 
coverage exceeding j-^. These Bayesian confidence intervals includes an equal tailed modification 
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or approximation of the HPD credible set which coincides with the latter when the distribution 
of the underlying pivot is symmetric. We have made use of the spending function interpretation 
of Bayesian confidence intervals. In seeking to evaluate the frequentist performance of Bayesian 
confidence intervals, our results illustrate that the choice of bounds matters, so that there does 
not necessarily exist a single universal assessment of their frequentist performance even in a given 
specific problem. 
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